Exploring uplift modeling with high class imbalance
نویسندگان
چکیده
Abstract Uplift modeling refers to individual level causal inference. Existing research on the topic ignores one prevalent and important aspect: high class imbalance. For instance in online environments uplift is used optimally target ads discounts, but very few users ever end up clicking an ad or buying. One common approach deal with imbalance classification by undersampling dataset. In this work, we show how can be extended modeling. We propose four methods for compare proposed empirically when some have a tendency break down. key observation that accounting particularly random forests, which explains poor performance of model earlier works. Undersampling also crucial class-variable transformation based models.
منابع مشابه
Pessimistic Uplift Modeling
Uplift modeling is a machine learning technique that aims to model treatment effects heterogeneity. It has been used in business and health sectors to predict the effect of a specific action on a given individual. Despite its advantages, uplift models show high sensitivity to noise and disturbance, which leads to unreliable results. In this paper we show different approaches to address the prob...
متن کاملUplift Modeling in Direct Marketing
Marketing campaigns directed to randomly selected customers often generate huge costs and a weak response. Moreover, such campaigns tend to unnecessarily annoy customers and make them less likely to answer to future communications. Precise targeting of marketing actions can potentially results in a greater return on investment. Usually, response models are used to select good targets. They aim ...
متن کاملClassification with class imbalance problem: A Review
Most existing classification approaches assume the underlying training set is evenly distributed. In class imbalanced classification, the training set for one class (majority) far surpassed the training set of the other class (minority), in which, the minority class is often the more interesting class. In this paper, we review the issues that come with learning from imbalanced class data sets a...
متن کاملDealing with Class Imbalance using Thresholding
We propose thresholding as an approach to deal with class imbalance. We define the concept of thresholding as a process of determining a decision boundary in the presence of a tunable parameter. The threshold is the maximum value of this tunable parameter where the conditions of a certain decision are satisfied. We show that thresholding is applicable not only for linear classifiers but also fo...
متن کاملClass Imbalance Learning
This report presents the work completed since the thesis proposal and the revised plan for the future PhD study. Two main issues have been discussed so far: diversity analysis of ensemble models in class imbalance learning, exploration of negative correlation learning on imbalanced data. Experimental design and main conclusions are simply described. More details are included in the two papers i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Mining and Knowledge Discovery
سال: 2023
ISSN: ['1573-756X', '1384-5810']
DOI: https://doi.org/10.1007/s10618-023-00917-9